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REMARKS 

The present reply is responsive to the Office Action 
mailed July 14, 2004. Claims 15, 26 and 31 have been amended. 
No new matter has been added by these amendments. Claims 1-34 
are again presented for the Examiner's consideration in view of 
the following comments. A substitute specification is provided 
herewith . 

The Examiner has objected to the abstract for 
inadvertently including a file identifier. A new abstract is 
submitted herewith on a separate sheet. Therefore, applicant 
respectfully requests that the objection to the abstract be 
withdrawn . 

The Examiner also objected to the title by asserting 
that it is "not descriptive." (Official Action, 5 3 at 4.) 
Applicant respectfully disagrees, . The title of the instant 
application is "System and Method for Processing Complex 
Computer Instructions." As described in the background of the 
invention, "[c]omputers routinely perform complex tasks by 
breaking a single, complex task into a series of simpler 
instructions that the computer understands." (Specification 
5 0001.) Aspects of the claimed invention are directed to "a 
system and method which is capable of quickly emulating complex 
instructions." {Id., 5 0006.) Furthermore, independent claim 1 
recites "[a] system for processing a computer instruction from a 
source of such instructions," which includes "a complex 
instruction detector." Independent claim 15 recites "[a] method 
of processing a computer instruction comprising" including 
"determining whether the computer instruction is complex." The 
title not only refers to_ the general nature _of the , invention 
described, but also refers to specific features of the claimed 
invention. Therefore, applicant respectfully requests that the 
objection to the title be withdrawn. 
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The Examiner also objected to the drawings under 37 
C.F.R. § 1.83(a). In particular, the Examiner objected to the 
figures in view of claims 6-14, 16-19, 22, 27-29 and 33-34. New 
figures 3(a), 4, 5 (a) -(c), 6 and 7 are attached herewith. No 
new matter has been added by the new drawings, as the features 
of the new drawings are fully supported by the original claims 
and specification as filed. Moreover, the drawings illustrate 
examples of the claims, and should not be deemed to limit the 
scope of the claims . 

Original figure 3 illustrates a flow diagram, 
including determination of an emulation code address in step 
202. As described in the specification, "[a]ddress generator 
102 then determines the address for the instruction based on the 
opcode (and potentially some or all of the operand) for the 
instruction word "DMULTU" (step 202)." Thus, it is believed 
that figure 3 addresses the Examiner's objection of the drawings 
in relation to claims 13, 14 and 17. 

Therefore, as each and every objection to the drawings 
has been addressed either in the original figures or in the new 
figures, applicant respectfully requests that the objections to 
the drawings be withdrawn. 

The Examiner rejected claims 1-2, 4-5, 9-11, 15-23, 
26-31 and 33-34 under 35 U.S.C. § 102(b) as being anticipated by 
U.S. Patent No. 5, 781, 750 to Blomgren et ai. Blomgren" ) , 
Applicant respectfully traverses the rejection. 

Blomgren teaches a CPU for handling two distinct 
instruction sets, CISC and RISC, When an extended RISC 
instruction is present, the system enters an emulation mode, 
wherein "complex instructions can be detected by the hardware 
and trapped to a software emulation driver. Thus some of the 
complexity of the CISC architecture is moved to the software 
driver. However, the emulation driver must be isolated and 
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hidden from the user's code being executed, otherwise the user 
programs could^ modify or destroy the emulation code, resulting 
in a system crash." (Col. 5, Ins. 17-23,) When in emulation 
mode, " [a] software routine will be executed that breaks the 
complex CISC instruction down into several smaller RISC 
instructions . " ( Id. , Ins . 58-61 . ) 

A mode register "contains bits to indicate the current 
operating mode of the CPU. One bit selects between the RISC and 
CISC user modes, while another bit enables the extended " RISC 
instructions for emulation mode." (Col. 6, Ins. 28-32.) A 
multiplexer "selects the decode instruction from either the RISC 
or the CISC decode sub-block. Mux 46 is controlled by the 
RISC/CISC mode control bit in mode register 38." (Id., 
Ins . 53-56 . ) 

Each of independent claims 1, 23 and 26 requires 
determining whether an instruction is a member of a set of 
instructions (e.g., a member of a set of complex instructions). 
Claim 1 recites "the output being indicative of whether the 
instruction is a member of a set of instructions." Claim 23 
recites "determining whether the original instruction is a 
member of the set of instructions." Claim 26 recites "output a 
value indicative of whether the instruction is a member of a set 
of instructions." Furthermore, claim 26 has been amended to 
recite "an address generator connected to the memory so as to 
receive computer instructions from the memory and to generate an 
address" in order to provide antecedent basis support for the 
term "address . " 

If a determination is made that the instruction is a 
member, "then"' an "address is extracted and a jump instruction 
executed. If a determination is made that the instruction is 
not a member, then another type of instruction is chosen. For 
example, claim 1 recites "the output comprising either the 



13 



Application No.: 09/935,878 



Docket No.: SCEI 3,0-157 



instruction from the source or the instruction from the jump 
instruction generator depending upon the output of the complex 
instruction detector." Claim 23 recites "selecting the jump and 
link instruction or the original instruction based on the result 
of the step of determining." Similarly, claim 26 recites 
"whereby depending on the value from the complex instruction 
detector, either the jump instruction or the computer 
instruction is provided by the instruction selector to the 
processor . " 

In each of these independent claims, a selection is 
made between an "original" instruction (e.g., the instruction 
from the source in claim 1, the original instruction in 
claim 23, or the computer instruction in claim 26) and a "jump" 
instruction (e.g., the instruction from the jump instruction 
generator of claim 1, the jump and linK instruction of claim 23, 
or the jump instruction of claim 26) . The "jump" instruction is 
generated based at least in part on the "original" instruction. 

In contrast, the multiplexer of Blomgren simply 
determines whether the CPU is operating in a RISC mode or a CISC 
mode without regard to whether the instruction is a member. The 
multiplexer does not select a jump instruction based on whether 
the newly-loaded instruction is a. member but rather based on the 
value of a bit flag common to the entire chip. 

Blomgren neither teaches nor suggests determining 
whether an instruction is a member of a set of instructions and 
selecting between an "original" instruction and a "jump" 
instruction based upon the determination. Therefore, because 
Blomgren lacks a disclosure or teaching of all of the elements 
"Of independent claims 1, 23 and 26, applicant respectfully 
requests reconsideration and allowance of these claims. 
Claims 2, 4-5, 9-11 and 27-30 depend from claims 1 and 26, 
respectively, and contain all of the limitations thereof as well 
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as other limitations that are neither disclosed nor suggested by 
the prior art of record. Accordingly, applicant submits that 
the dependent claims are likewise patentable. 

Independent claim 15 has been amended to require 
"wherein determining whether the computer instruction is complex 
is performed in parallel with the steps of generating the 
address and generating the jump instruction." Similarly, 
independent claim 31 has been amended to require "wherein the 
complex instruction detector operates in parallel with the 
address generator and the jump instruction generator." Support 
for these amendments can be found in paragraphs 0031-0035 of the 
specification and in FIGS. 2-3. 

Blomgren neither teaches nor suggests the parallel 
process of either claim 15 or of claim 31. Therefore, because 
Blomgren lacks a disclosure or teaching of all of the elements 
of independent claims 15 and 31, applicant respectfully requests 
reconsideration and allowance of these claims. Claims 16-22 and 
33-34 depend from claims 15 and 31, respectively, and contain 
all of the limitations thereof as well as other limitations that 
are neither disclosed nor suggested by the prior art of record. 
Accordingly, applicant submits that the dependent claims are 
likewise patentable. 

Claims 6-8 and 12-14 were rejected under 
35 U.S.C. § 103(a) as being obvious in view of Blomgren. 
Claims 3, 24-25 and 32 were rejected under 35 U.S.C. § 103(a) as 
being obvious over Blomgren in view of U.S. Patent No. 5,826,089 
to Jreton (" Jreton") . For at least the reasons discussed above, 
independent claims 1, 15, 23, 26 and 31 are patentable. 
"Claims 3, 6-8, 12-14, 24-25 and 32 depend from independent 
claims 1, 23 and 31, respectively, and contain all of the 
limitations thereof as well as other limitations that are 
neither disclosed nor suggested by the prior art of record. 
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Accordingly, applicant submits that the dependent claims are 
likewise patentable . 

As it is believed that all of the objections and 
rejections set forth in the Official Action have been fully met 
by the foregoing amendments and remarks, favorable 
reconsideration and allowance are earnestly solicited. If, 
however, for any reason the Examiner does not believe that such 
action can be taken at this time, it is respectfully requested 
that she telephone applicant's attorney at (908) 654-5000 in 
order to overcome any additional objections which she might 
have. 

If there are any additional charges in connection with 
this requested amendment, the Examiner is authorized to charge 
Deposit Account No. 12-1095 therefor. 

Dated: October 14, 2004 Re spec^Eu^^ 

Andrew T. Zidel 
Registration No.: 45,256 
LERNER, DAVID, LITTENBERG, 
KRUMHOLZ & MENTLIK, LLP 
600 South Avenue West 
Westfield, New Jersey 07090 
(908) 654-5000 
Attorney for Applicant 

520668_1.DOC 
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MARKED-UP COPY OF 
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%1 

PATENT 
SCEI 3.0-157 

SYSTEM AND METHOD FOR 
PROCESSING COMPLEX COMPUTER INSTRUCTIONS 



BACKGROUND OF THE INVENTION 

[0001] Coir.puters routinely pcrforrr. ccrriplex tasks by breaking a 
single, complex task into a series of simpler instructions 
that the computer understands. These instructions are 

commonly referred to as "machine instructions," meaning that 
they are instructions which a computer processor chip can 
understand. In fact, even the simplest task for a human may 
involve a complex set of many different machine instructions. 
For example, a typical computer-processing chip will need to 
perform a number of different steps just to count from 2 to 
10. A processor may: convert the number 2 into a binary- 
based value of one's and zero's and store the result in an 
area of memory known as a register; convert the number to 
increment by, i.e., the value of 1, to a binary-based value 
and store the result in another register; convert and store 
the value of 10 in yet another register; pass the binary 
values contained in the first two registers to circuits for 
adding two numbers together; store the sum back in the first 
register; check the new value in the first register against 
the value in the second register; and depending on the result 
of the comparison, repeat the process of adding and storing. 
[0002] In fact, machine instructions may be broken down into 
even smaller machine instructions. For example, many chips 
are based on the CISC architecture (Complex Instruction Set 
Computer) . In - CISC, machine instructions- can- range f rom 
relatively simple to rather complex. During operation, each 
machine instruction is first converted into one or more 
"microinstructions" and the computer chip then executes the 
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individual microinstructions. The conversion of machine, 
instructions into microinstructions takes up processing time. 
[0003] Accordingly, in order to increase processing speed, 
many computer chips now eliminate the conversion step. For 
example, as its name implies, RISC-based computer systems 
(Reduced Instruction Set Computer) tend to limit the machine 
instructions to a set of those instructions that can be 
immediately executed by the processor in a single tirrie cycle 
throughput (A RISC processor needs multiple cycles to execute 
a instruction through a pipeline, but the throughput of the 
execution is one clock cycle) . Thus, the delay in converting 
the machine instruction is avoided. 

[0004] However, despite the limited and simplified 
instructions, many processor chips need to emulate more 
complex instructions. In some systems, when the chip sees a 
complex instruction which it cannot execute in a single step, 
it jumps to an area of the chip which breaks the complex 
instruction down into individual machine instructions. For 
example, programs are typically stored as a list of machine 
instructions, and each machine instruction is stored at its 
own unique address. The processor generally executes the 
instructions in sequence unless instructed to jump to a 
different address. So, when the processor encounters a 
complex instruction, it may implement a Jump and Link 
instruction to the area of the chip containing machine 
instructions that emulate the complex instruction. 
Specifically, the Jump and Link instruction causes the 
processor to store the address of the current instruction and 
then jump to a new address. The processor then executes the 
machine instruction at the new address. The processor 
continues executing -those instructions until it encounters -an 
instruction to return to the place of the j ump . Upon 
returning, the processor picks up where it left off, i.e., the 
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instruction at the address following the original Jump and 
Link instruction. 

[0005] Complex instructions can be emulated using software or 
hardware. For example, some systems use no special hardware 
to implement complex instructions; their opcodes are treated 
as reserved instruction opcodes, which generate exception 
interrupts. In the interrupt procedure, the complex 
instruction opcode is examined to chsck if it is an illegal 
instruction code or to be emulated as a complex instruction. 
In other systems, the CPU has hardware to convert a complex 
instruction to multiple implemented simple instructions, such 
as microcodes. In these systems, the converting hardware may 
become quite complex. 

[0006] There is a need for a system and method which is 
capable of quickly emulating complex instructions. 
SUMMARY OF THE INVENTION 

[0007] The present invention is directed towards those needs, 
[0008] In one aspect, the invention provides a system and 
method for handling complex instructions including generating 
a jump and link instruction from an address which may be 
embedded in a computer instruction and selecting the jump 
instruction if the original instruction was complex or the 
original instruction if it was not. 

[0009] Another aspect of the invention provides a system for 
processing a computer instruction from a source of such 
instructions. The system includes: a complex instruction 
detector having an input and an output, the input accepting 
computer instructions from the source and the output being 
indicative of whether the instruction is a member of set of 
instructions; an address generator having an input and an 
-output, --the input accepting computer instructions- -having a 
value and the output comprising an address based on the value; 
a jump instruction generator having an input and an output, 
the input being in communication with the address generator 
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output and the output comprising an instruction to jump to the 
address from the address generator; and an instruction 
selector having inputs and an output ^ the inputs being in 
communication with the jump instruction generator, the source 
and the complex instruction detector, the output comprising 
the instruction from the source or the instruction from the 
jump instruction generator depending upon the output of the 

[0010] Yet another aspect provides a method of processing a 
computer instruction. The method includes generating an 
address from the value associated with the computer 
instruction; generating a jump instruction based on said 
address; determining whether the computer instruction is 
complex; and selecting the jump instruction or the computer 
instruction . 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0011] FIGURE 1 is a schematic diagram of portions of a prior 
art system. 

[0012] FIGURE 2 is a schematic diagram of one aspect of the 
invention. 

[0013] FIGURE .3 is a flow chart of one aspect of the 
invention . 

[0014] FIGURE 3(a) is a flow chart of one aspect of the 
invention . 

[0015] FIGURE 4 is a schematic diagram of one aspect of the 
present invention . 

[0016] FIGURES 5 (a) -(c) illustrate a machine instruction and 
opcode processing in accordance with one aspect of the present 
invention . 

[0017] FIGURE 6 illustrates a set of instruction in accordance 

with one aspect of the present invention. -- 

[0018] FIGURE 7 illustrates addressing and instructions in 
accordance with one aspect of the present invention. 
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DETAILED DESCRIPTION OF THE EMBODIMENTS 

[0014] [0019] A prior art processing system is shown in 
FIGURE 1. Although the invention is not limited to use in 
such a system, it shall be used as an example for the purpose 
df illustration only. Memory 51 is a source of machine 
instructions which are to be executed by the processor. 
Memory 51 may be physically located any number of different 
places, such as by v;ay of example only an LI cache which is 
built into the chip containing the processor's circuits, an L2 
cache which is located externally to the chip, or some other 
storage device or medium. The machine instructions (also, 
"instruction code") are received from memory 51 and placed in 
instruction code cache 52. Because it takes time to get 
instructions from memory 51, the instruction code cache is 
used to hold a certain number of instructions so they can be 
quickly accessed and executed by the processor. 

[0015] [0020] A system 100 in accordance with the present 
invention is shown in FIGURE 2. Rather than passing the 
instruction code directly from memory 51 into the instruction 
cache memory 52, the instruction code is passed through system 
100, System 100 includes an address generator 102, jump 
instruction generator 104, complex instruction detector 106 
and instruction selector 108, all of which are in direct or 
indirect communication with one another. The address 

generator 102, the jump instruction generator 104, the complex 
instruction detector 106 and the instruction selector 108 may 
be located on a single ■ computer chip 120 (e.g., a RISC-based 
chip), as seen in FIGURE 4. The memory 51 may be an on-chip 
memory 51a or an off-chip memory 51b, 

[0016] [0021] One of the functions performed by the Complex 
Instruction- Detector 10-6- is determining whether an- instruction 
is complex. Complex instruction detector 106 accepts the 

instruction codes from memory 51 as input and determines 
whether the instruction is complex, i.e., whether it is a 
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member of the set of instructions which the processor can 
execute. In other words, it determines whether the 

instruction can be executed by the CPU hardware directly 
instead of needing to be implemented by executing multiple 
instructions. The output of the complex instruction detector 
is preferably, but not necessarily, a single bit which 
indicates whether the instruction is or is not complex. As 
s c G n in F I GU-RE 6 , t he go t of i n s t rue t i on 3 GOO ma y i n cl ud £ a 
subset 602 that includes instructions which are capable of 
being executed by the processor without additional decoding. 
The set of instructions 600 may also include a subset 604 that 
includes instructions which are capable of being executed by 
the processor in a single cycle throughput. 

[0017] [0022] As seen in FIGURE 5(a), Machine machine 

instructions 130 typically have two components: the operation 
code (also, "opcode") 132 and the operand 134 . These two 
components are then combined to form a single digital value. 
The operation code 132 basically tells the computer the 
operation to be performed, such as jumping to an address, 
adding two numbers or storing a value. Each operation code 
corresponds with a different binary value. For example, the 
Jump and Link opcode 132 m ay be represented by the value "0000 
11" whereas the storing function may be represented by the 
value "1010 00". The operand 132 contains the data to be used 
by the operation. For example, the operand 134 may represent 
a target address having the value "10 1111 0000 1010 1111 0000 
0001." Thus, a Jump and Link instruction to that address 
would be represented by a combination of the opcode and the 
operand values, e.g., "0000 1110 1111 0000 1010 1111 0000 
0001. " 

[0G1 8 ]- [0023] In one aspect of the present -invention, the 
complex instructions detector 106 uses the reserved 
instruction exception to determine whether the opcode 
represents a complex instruction. In typical CPU's, the 
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reserved instruction exception is used to prevent execution of 
unimplemented or illegal instructions on the CPU hardware. 
The exception usually occurs when the processor executes data 
as instructions accidentally. Accordingly, complex 

instruction detector 106 may, but not necessarily, use the 
logic ' associated with generating reserved instruction 
exceptions to generate a signal indicative of complexity. For 
exaiT^.ple, as seen in FIGURE 4, the complex inct ruction detector 
may include a reserved instruction exception handler 112. 
[0019] [0024] If the instruction is complex, then the present 
invention replaces the complex instruction with a jump and 
link instruction to an area of the chip ( as seen in FIGURE 4, 
e.g., memory 51a if the memory is contained on the chip or 
memory 51b if the memory is not contained on the chip) that 
emulates the complex instruction with individual machine 
instructions , 

[0020] [0025] One of the purposes of address generator 102 is 
to determine where the emulation code resides for particular 
complex instructions. The address generator 102 accepts 
instruction codes from memory 51 as input. Address generator 
102 then determines the address of the emulation code if the 
instruction is complex. The output is supplied to jump 
instruction generator 104. 

[0021] [0026] The address generator is not limited to any 
particular manner of translating opcodes into memory 
addresses. For example, it may include a Programmable Logic 
Array (PLA) 114 whereby the input comprises an opcode and the 
output equals the address . For example, as seen in FIGURE 
5 (b) , the PLA 114 may specify that a complex instruction 
opcode 132 of 101010 always jump to emulation code at an 
address 136-, for, instance - address -0x1234-567 - or the like. - For 
instance, as seen in FIGURE 7, address 700 may point to other 
instructions 702 and 704 which are capable of execution by the 
processor and which emulate the computer instruction. ^A 
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different complex instruction may jump to code at a different 
address. These addresses may be anywhere in the memory 

space, on or off of the chip. If the emulation code is stored 
on a chip, it may be quickly accessed but the size becomes 
more limited and not necessarily re-programmable. If the 
emulation code is stored in a main memory, it may be 
relatively slower but also more flexible. 

r Q.ng [0-02 7 ] Juinp instruction g'snerator 10^ generates a juinp 
instruction using the address generated by address generator 
102. For example, regardless of whether the original 

instruction was complex or not, it appends the opcode for Jump 
and Link to the address from address generator 102. 
[0023] [0028] Instruction selector 108 selects either the 
instruction as unmodified from memory 51 or the Jump and Link 
instruction created by jump instruction generator 104. The 
instruction selector makes this choice based on the output 
value of complex instruction detector 106, By way of example, 
if the output value of the complex instruction detector 106 is 
1, then instruction selector 108 will output the instruction 
received from jump instruction generator 104 to instruction 
cache memory 52. Otherwise, if the output value of the 
complex instruction detector is 0, then instruction selector 
108 will output ■ the instruction received from memory 51 to 
instruction cache memory 52; in other words, it is passed 
unmodified to the instruction cache memory 52. Preferably, 
if the instruction is not complex but rather, illegal, the 
output of the instruction selector 108 will indicate that the 
instruction is not complex. Accordingly, rather than being 
emulated, illegal instructions will get passed into the 
execution pipeline and cause an exception. 

-faQ2-4-}-r0029] For .. purposes .of illustration, .an exemplary 

operation may include the following steps. Memory 51 is 
assumed to contain the following three machine instructions: 
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Inst#0 

DMULTU rs, rt 
Inst#2 

where "lnst#0" and "Inst#2" represent any instruction and 
"DMULTU rs, rt" represents an instruction to multiply 64-bit 
sized operand values stored in registers rs and rt to obtain a 
128-bit si7.f=»d result. (The DMULTU function is supported by 
the MIPS 64 5K™ Processor as described in the MIPS 64 
Processor Core Family Software User's Manual/ dated 2001). 
Rather than incurring the expense of providing large-value 
multiplier hardware with the processor, a processor in 
accordance with the present invention is able to emulate the 
DMULTU instruction, without changing the original 
instructions, by using a standard, less expensive 32-bit 
multiplier. 

[0025] [0030] When the DMULTU instruction is transferred from 
memory 51 into instruction cache 52, it is first passed into 
the system 100 (FIGURE ^3, step 201). For purposes of 
illustration, it shall be assumed that all opcodes of the 
system are 6 bits long and that DMULTU is represented by the 
binary value "011101". 

[0026] [0031] The instruction is simultaneously passed to the 
address generator 102, one of the inputs to the instruction 
selector 108 and the complex instruction detector 106. 
[0027] [0032] Address generator 102 then determines the 
address for the instruction based on the opcode (and 
potentially some or all of the operand) for the instruction 
word "DMULTU" (step 202) The address may identify computer 
instructions for emulating the complex computer instruction 
(FIGURE 3(a), substep 222), 

[002 8 ] [0033] Accordingly, an address having a value 
hereafter designated as emu_DMULTU will be determined and 
passed to jump instruction generator 104 (step •2-&5202) . Jump 
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instruction generator 104 then appends the opcode value for 
Jump and Link to the address from the address generator 102 to 
create the instruction "JAL emu_DMULTU" (step 3^203) The 
JAL instruction is then provided to the instruction selector 
108 (step 204) , 

[002D] [0034] The original instruction "DMULTU rs, rt" is 
also passed to the complex instruction detector 106. Detector 
106 tests the Instruction to see whether it is corr.pl ex or not 
(step 205) . As seen in FIG. 3(a)^ step 205 may include 
determining whether the computer instruction generates a 
reserved instruction exception (substep 220) . It shall be 
assumed that DMULTU would test positive as a complex 
instruction and, accordingly, a bit value of 1 is provided by 
detector 106 to instruction selector 108. 

[0030] [0035] Upon receipt of the various inputs, instruction 
selector 108 selects the instruction directly from memory if 
the bit value of the detector 106 is 0 (step 208) and selects 
the instruction from generator 104 if the bit value of the 
detector 106 is 1 (step 206) . Using the foregoing example, 
the bit value is 1, so the instruction "JAL emu_DMULTU" would 
be sent to the instruction cache instead of "DMULTU rs, rt" 
(step 207, 209) . This will cause the processor to jump and 
link to the code at address emu_DMULTU, which uses the 
following code for a 64-bit multiplier to emulate the code for 
a 128-bit multiplier. It makes 64-bit partial products and 
adds them to get a 128bit product. The operation is 
HI (63:0) |LO(63:0) < rs(63:0) * rt(63:0): 



Ernu_DMULTU: 

MULTU rs, rt 

MOV tls, rs 

MOV tit, r"t 

DSRL32 ths, rs, 0 

DSRL32 tht, rt, 0 

MFLO xl 

MFHI xh 



HIILO <- rs(31:0) *rt (31:0) 
tls <- rs 
tit <- rt 

ths(31:0) <-rs(63:32) 
tht (31:0) <-rt(63:32) 
xl <- LO 
x2 <- HI 



DSLL32 


xh, 


xh, 


0 


; xh(63 


:32) <- xh(31:0) 




OR 


xl. 


xh. 


xl 


; xl <- 


result of rs (31:0) *rt (31: 


0) 


MULTU 


ths. 


rt 




; HIILO 


<- rs(63:32) *rt (31:0) 




MFLO 


xl 






; xl <- 


LO 




MFHI 


xh 






; x2 <- 


HI 




DSLL32 


xh. 


xh. 


0 


; xh(63 


:32) <- xh(31:0) 




OR 


x2, 


xh. 


xl 


; x2 <- 


result of rs (63:32) *rt (31 


:0) 


MULTU 


rs, 


tht 




; HIILO 


<- rs(31:0)*rt (63:32) 




MFLO 


xl 






; xl <- 


LO 




MFHI 


xh 






• x2 <- 


HI 




DSLL32 


xh, 


xh. 


0 


• xh(63 


:32) <- xh(31:0) 




OR 


X3 r 


xh. 


xl 


• x3 <- 


result of rs (31:0) *rt (63: 


32) 


MULTU 


ths. 


tht 


? HIILO 


<- rs(63:32)*rt (63:32) 




MFLO 


xl 






; xl <- 


LO 




MFHI 


xh 






? x2 <- 


HI 




DSLL32 


xh, 


xh. 


0 


; xh(63 


:32) <- xh(31:0) 




OR 


x4, 


xh. 


xl 


• x4 <- 


result of rs (63:32) *rt (63 


:32) 


ADD 


xm, 


x2, 


x3 


] xm <-x2 + x3 




nSLL3 2 


xmx f 


xm, 


0 


• xmx(63:32) <- xm(31:0) 




ADD 


xl, 


xmx, 


xl 


• xl <-xmx + xl 




MTLO 


xl 






• LO <- 


xl 




ADD 


x4. 


xmx. 


x4 


• x4 <- 


xmx = x4 




MTHI 


x4 






■ JO <- 


x4 




JR 


r31 






Link Return 





Accordingly, the foregoing example illustrates how the complex 
instruction DMULTU may be quickly emulated by other 
instructions. 

[0031] [0036] The present invention provides a number of 
advantages. It allows complex instructions to be emulated by 
jumping to simpler sets of instructions in a quick manner 
without complex decoding. Rather than waiting to see if an 
instruction is complex or not, the address is extracted and a 
Jump instruction is created. The instruction is also 

simultaneously- tested for complexity. - If it is^ complex, - the 
Jump Instruction to the emulation code is ready and waiting. 
If it is not, no time is lost because the instruction would 
have been tested for complexity anyway. 
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[0032] [0037] The invention is also advantageous in that it 
permits different implementations which can vary with the 
needs of the application. For example, address generator 102 
may use more than just the opcode to determine the address of 
the emulated code. For some or all of the opcodes, it may use 
the operands as well, such that the same complex instruction 
may jump to different addresses and different emulation code 
He^oenrjtnQ on the arguments used by the instruction- For 
example, as seen in FIGURE 5(c), the address generator 102 may 
process machine instructions 130a and 130b and may jump the 
two instructions to different addresses 136a and 136b, as seen 

by lines 138a and 140, respectively. Alternatively, the 

address generator 102 may jump all complex instructions to the 
same address, thus leaving it up to the emulation code to 
decode the complex instruction and determine the details of 
the opcode and operands (if any) • As shown in FIGURE 5(c) , 
the address generator 102 may process the machine instructions 
130a and 130b and may jump them to the same address, address 
136b, as seen by dashed line 138b and line 140, respectively. 
Accordingly, one or the other or a combination of both methods 
can be used to fit the needs of the chip and the chip's 
application. If there are a limited number of complex 
instructions, then each instruction may go to a unique 
address. If there are large number of complex instructions, 
then nearly all or all of complex instructions may be jumped 
to the same address. 

[0033] [0038] For example, in connection with the DMULTU 
instruction above, it is preferable for the address generator 
102 to decode some or all operands and generate an address 
according to the operands. This will ensure that the JAL 
.instruction -j-umps to emulation code which is capable of 
determining the source and destination registers defined by 
the operands. More specifically, some of the upper bits of the 
output address may be generated according to the opcode, and 
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lower bits may be generated based on the operands. 
Alternatively, the instruction could be completely decoded by 
software after the address generator 102 generates an address 
(such as by subtracting four from the link register) . 
[0034] [0039] Another advantage of the invention arises out 
of using the reserved instruction exception in connection with 
the complex instruction detector 1.06, This allows the complex 
i n s t r uc^ "i '^Ti cie tec-tor 106 to actually perform tv.'o functions; 
determining' whether the instruction is actually illegal code 
or, instead, a complex code. If it is a complex code, then it 
is emulated- Otherwise, it is treated as a real error. The 
reserved instruction exception can also be implemented in the 
address generator 102 by checking opcodes not only for 
complexity, but also for validity . 

[0035] [0040] Moreover, all or part of the decoding of 
complex instructions is done by hardware without degrading the 
performance of the instruction execution pipeline because the 
instruction can decoded and replaced with a Jump and Link 
instruction between the memory system and a cache outside of 
the execution pipeline. In the case of using a reserved 
instruction exception, the instruction is decoded by software 
only. Even so, because the invention replaces a complex 
instruction to a Jump and Link instruction, the hardware 
implementation is relatively simple and it does not require 
changes to the execution pipeline . 

[0036] [0041] Unless stated to the contrary, any use of the 
words such as "including," "containing," "comprising" and the 
like, means "including without limitation" and shall not be 
construed to limit any general statement that it follows to 
the specific or similar items or matters immediately following 
it. References to. -a "plurality" of things means at least two- 
of the things, and, unless stated to the contrary, reference 
to "a" or "one" thing does exclude the possibility of using a 
plurality of such things. 



[0037] [0042] Most of the foregoing alternative embodiments 
are not mutually exclusive, but may be implemented in various 
combinations to achieve unique advantages. As these and other 
variations and combinations of the features discussed above 
can be utilized without departing from the invention as 
defined by the claims, the foregoing description of the 
embodiments should be taken by way of illustration rather than 

HvV7 ui r\-F "l-im-i+'ra-f-T r^r^ r\-f +- K <=s -i -r* ^tcm^ +- -i /^-t^ e- "* ^ S d b V t h S 

claims . 
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